S 1 : Methods description and relationships between module preservation statistics

نویسندگان

  • Peter Langfelder
  • Luo Rui
  • Michael C. Oldham
  • Steve Horvath
چکیده

Here, we provide additional methodological details regarding the module preservation statistics. In the first section, we describe standard cross-tabulation based module preservation statistics. Specifically, we present three basic crosstabulation based statistics for determining whether modules in a reference data set are preserved in a test data set. These statistics do not assume that a test network is available. Instead, module assignments in both the reference and the test networks are needed. In the second section, we briefly review a hierarchical clustering procedure for module detection. Many methods exist for defining network modules. In this section, we describe the method used in our applications but it is worth repeating that our preservation statistics apply to most alternative module detection procedures. In the third section, we review the definition of signed and unsigned correlation networks. Correlation networks are a special case of general undirected networks in which the adjacency is constructed on the basis of correlations between quantitative variables. In the fourth section, we present module quality statistics that we are implemented in the modulePreservation R function. While our main article focuses on statistics that measure preservation of modules between a reference and a test network, we briefly discuss the application of some of the preservation statistics to the related but distinct task of measuring module quality in a single (reference) network. More precisely, the density and separability statistics can be applied to the reference network without a reference to a test network. The results can then be interpreted as measuring module quality, that is how closely interconnected the nodes of a module are or how well a module is separated from other modules in the network. In the fifth section, we review the notation for the singular value decomposition and for defining a module eigennnode. The section describes conditions when the eigenvector E is an optimal way of representing a correlation module. It also reviews the definition of propVarExpl (the proportion of the variance explained by the eigennode). We derive a relationship between propVarExpl and the module membership measures kME , which will be useful for deriving relationships between preservation statistics. In the sixth section, we investigate relationships between preservation statistics in correlation networks. An advantage of an (unsigned) weighted correlation network is that it allows one to derive simple relationships between network concepts [1, 2]. We characterize correlation modules where simple relationships exist between i) density-based preservation statistics, ii) connectivity based preservation statistics, and iii) separability based preservation statistics. Apart from studying relationships among preservation statistics in correlation networks, we also briefly describe relationships between preservation statistics in general networks. In the seventh section we briefly review the In-Group Proportion method [3].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Is My Network Module Preserved and Reproducible?

In many applications, one is interested in determining which of the properties of a network module change across conditions. For example, to validate the existence of a module, it is desirable to show that it is reproducible (or preserved) in an independent test network. Here we study several types of network preservation statistics that do not require a module assignment in the test network. W...

متن کامل

Module contractibility for semigroup algebras

In this paper, we nd the relationships between module contractibility of aBanach algebra and its ideals. We also prove that module contractibility ofa Banach algebra is equivalent to module contractibility of its module uniti-zation. Finally, we show that when a maximal group homomorphic image ofan inverse semigroup S with the set of idempotents E is nite, the moduleprojective tensor product l1...

متن کامل

S 6 Comparison studies on simulated data

In this document we illustrate the module preservation statistics on simulated data. We describe the simulation method and seven simulation studies in more detail than in the main text. The design and main results of the simulations are summarized in Figure 8 of the main article which we reproduce here for convenience in Figure 1. A complete table of results can be found in the accompanying Sup...

متن کامل

A scalable permutation approach reveals replication and preservation patterns of gene coexpression modules

Gene coexpression network modules provide a framework for identifying shared biological functions. Analysis of topological preservation of modules across datasets is important for assessing reproducibility, and can reveal common function between tissues, cell types, and species. Although module preservation statistics have been developed, heuristics have been required for significance testing. ...

متن کامل

Metadata Enrichment for Digital Preservation

Description of structural and semantic relationships and properties of, within, and between resources is seen as a key issue in digital preservation. But the markup languages used to encode descriptions for migration between and storage within digital repositories are subject to the same interpretive problems that complicate other uses of markup. This paper reports on a project that aims to add...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010